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Abstract 

We present a parser for Abstraet Meaning 
Representation (AMR). We treat English- 
to-AMR eonversion within the framework 
of string-to-tree, syntax-based maehine 
translation (SBMT). To make this work, 
we transform the AMR strueture into a 
form suitable for the meehanies of SBMT 
and useful for modeling. We introduee 
an AMR-speeifie language model and add 
data and features drawn from semantie re- 
sourees. Our resulting AMR parser im¬ 
proves upon state-of-the-art results by 7 
Smateh points. 

1 Introduction 
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The soldier was not afraid of dying. 
The soldier was not afraid to die. 
The soldier did not fear death. 


Figure 1: An Abstraet Meaning Representation 
(AMR) with several English renderings. 
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Abstraet Meaning Representation (AMR) is a 
eompaet, readable, whole-sentenee semantie an¬ 
notation (Banareseu et al., 2013). It ineludes en¬ 
tity identifieation and typing, PropBank semantie 
roles (Kingsbury and Palmer, 2002), individual en¬ 
tities playing multiple roles, as well as treatments 
of modality, negation, ete. AMR abstraets in nu¬ 
merous ways, e.g., by assigning the same eoneep- 
tual strueture to fear {\),fear (n), and afraid (adj). 
Figure 1 gives an example of an AMR with several 
English renderings. 

In this paper, we automatieally learn how to 
parse English into AMR, by exploiting a pub- 
liely available eorpus of more than 10,000 En¬ 
glish/AMR pairs. ^ 

The AMR parsing problem bears a strong for¬ 
mal resemblanee to syntax-based maehine transla¬ 
tion (SBMT) of the string-to-tree variety, as shown 
in Figure 2. Beeause of this, it is appealing to eon- 
sider whether we ean apply the substantial body of 
teehniques already invented for SBMT.^ 

‘LDC Catalog number 2014T12 

^See e.g. the related work section of Huck et al. (2014). 


Figure 2: Similarities between AMR parsing and 
syntax-based maehine translation (SBMT). 

However, there are also some major differenees, 
as shown in Figure 3. Therefore, applying SBMT 
to AMR parsing requires novel representations 
and teehniques, whieh we develop in this paper. 
Some of our key ideas inelude: 

1. Introdueing an AMR-equivalent representa¬ 
tion that is suitable for string-to-tree SBMT 
rule extraetion and deeoding. 

2. Proposing a target-side reordering teehnique 
that takes unique advantage of the faet that 
ehild nodes in AMR are unordered. 

3. Introdueing an AMR-speeifie language 
model. 

4. Developing tuning methods that maximize 
Smateh (Cai and Knight, 2013). 

5. Integrating several semantie knowledge 
sourees into the AMR parsing task. 

These key ideas lead to state-of-the-art AMR pars¬ 
ing results. We next deseribe our baseline SBMT 
system, and then we adapt it to AMR parsing. 
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Figure 3: Differences between AMR parsing and 
SBMT. 

2 Syntax-Based Machine Translation 

Our baseline SBMT system proceeds as follows. 
Given a corpus of (source string, target tree, 
source-target word alignment) sentence translation 
training tuples and a corpus of (source, target, 
score) sentence translation tuning tuples: 

1. Rule extraction: A grammar of string-to- 
tree rules is induced from training tuples us¬ 
ing the GHKM algorithm (Galley et al, 2004; 
Galley et al, 2006). 

2. Local feature calculation: Statistical and in¬ 
dicator features, as described by Chiang et al. 
(2009), are calculated over the rule grammar. 

3. Language model calculation: A Kneser- 
Ney-interpolated 5-gram language model 
(Chen and Goodman, 1996) is learned from 
the yield of the target training trees. 

4. Decoding: A beamed bottom-up chart de¬ 
coder calculates the optimal derivations given 
a source string and feature parameter set. 

5. Tuning: Feature parameters are optimized 
using the MIRA learning approach (Chiang 
et al., 2009) to maximize the objective, typi¬ 
cally Bleu (Papineni et al., 2002), associated 
with a tuning corpus. 

We initially use this system with no modifica¬ 
tions and pretend that English-AMR is a language 
pair indistinct from any other. 

3 Data and Comparisons 

We use English-AMR data from the AMR 1.0 cor¬ 
pus, EDC Catalog number 2014T12. In contrast 
to narrow-domain data sources that are often used 
in work related to semantic parsing (Price, 1990; 
Zelle, 1995; Kuhlmann et al., 2004), the AMR cor¬ 
pus covers a broad range of news and web forum 
data. We use the training, development, and test 
splits specified in fhe AMR corpus (Table 1). The 
fraining sef is used for rule exfracfion, language 


Corpus 

Eines 

Tokens 

Training 

10,313 

218,021 

Development 

1,368 

29,484 

Test 

1,371 

30,263 


Table 1: Dafa splifs of AMR 1.0, used in fhis work. 
Tokens are English, after fokenizafion. 

modeling, and sfafisfical rule fealure calculafion. 
The developmenf sef is used bofh for paramefer 
opfimizafion and qualifafively for hill-climbing. 
The fesf sef is held ouf blind for evaluafion. We 
preprocess fhe English wifh a simple rule-based 
fokenizer and, excepf where noted, lowercase all 
dafa. We obfain English-AMR alignmenfs by us¬ 
ing fhe unsupervised alignmenf approach of Pour- 
damghani el al. (2014). All parsing resulls re- 
porled in fhis work are obfained wifh fhe Smalch 
1.0 soflware (Cai and Knighl, 2013). We compare 
our resulls lo Ihose of Elanigan el al. (2014) on fhe 
AMR 1.0 dafa splifs; we run lhal work’s JAMR 
soflware according lo Ihe provided inslruclions.^ 

4 AMR Transformations 

In Ihis seclion we discuss various Iransformalions 
lo our AMR dafa. Inilially, we concern ourselves 
wilh converling AMR into a form lhal is amenable 
to GHKM rule exlraclion and siring to free decod¬ 
ing. We Ihen torn to slruclural Iransformalions 
designed to improve system performance. Eig- 
ure 4 progressively shows all Ihe Iransformalions 
described in Ihis seclion; Ihe example we follow is 
shown in ils original form in Eigure 4a. 

4.1 Massaging AMRs into Syntax-Style Trees 

The relalionships in AMR form a Directed Acyclic 
Graph (DAG), bul GHKM requires a free, so 
we must begin our transformations by discarding 
some information. We arbitrarily disconnect all 
but a single parent from each node (see Eigure 4b). 
This is the only lossy modification we make to our 
AMR data. As multi-parent relationships occur 
approximately once per training sentence, this is 
indeed a regrettable loss. We nevertheless make 
this modification, since it allows us to use the rest 
of our string-to-tree tools. 

AMR also contains labeled edges, unlike the 
constituent parse trees we are used to working with 
in SBMT. These labeled edges have informative 
content and we would like to use the alignment 

^https://github.com/jflanigan/jamr 





(a) The original AMR 
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(d) Restructuring (c) with concept la¬ 
bels as intermediates 



(b) Disconnecting multiple parents of 
(a) 



(e) Restructuring (c) with role labels 
as intermediates 
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(c) Edge labels of (b) pushed to leaves, 
preterminals added 


Pfear-Ol FARGO X PARGl X Ppolarity ^ 

feaioi ARGO / ARGl//\ . 

Psoldier ^ * X 

i 


Spolarity 


\ 

soldier P* 


dle-Ol 


(f) String preterminal relabeling of (c) 
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The soldier was not afraid of dying 

(g) Original alignment of English to 
(c) 
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The soldier was not afraid of dying 

(h) After reordering of (g) 



(i) Restractured, relabeled, and re¬ 
ordered tree: (e), (f), and (h) 


Figure 4: Transformation of AMR into tree strueture that is aeeeptable to GHKM (Galley et ah, 2004; 
Galley et ah, 2006) rule extraetion and yields good performanee. 


procedure of Pourdamghani et al. (2014), which 
aligns words to edges as well as to terminal nodes. 
So that our AMR trees are compatible with both 
our desired alignment approach and our desired 
rule extraction approach, we propagate edge labels 
to terminals via the following procedure: 

1. For each node n in the AMR tree we create 
a corresponding node m with the all-purpose 
symbol ‘X’ in the SBMT-like tree. Outgoing 
edges from n come in two flavors: concept 
edges, labeled ‘Inst’, which connect n to a 
terminal concept such as fear-01, and role 
edges, which have a variety of labels such as 
ARGO and name, and connect n to another 
instance or to a string.^ A node has one in¬ 
stance edge and zero or more role edges. We 
consider each type of edge separately. 

2. For each outgoing role edge we insert two un¬ 
labeled edges into the corresponding trans- 

"'in Figure 4 the negative polarity marker is a string. 
Disconnected referents labeled are treated as AMR in¬ 
stances with no roles. 


formation; the first is an edge from m to a 
terminal bearing the original edge’s role label 
(a so-called role label edge), and the second 
(a role filler edge) connects m to the trans¬ 
formation of the original edge’s target node, 
which we process recursively. String targets 
of a role receive an ‘X’ preterminal to be con¬ 
sistent with the form of role filler edges. 

3. For the outgoing concept edge we insert an 
unlabeled edge connecting m and the con¬ 
cept. It is unambiguous to determine which 
of m’s edges is the concept edge and which 
edges constitute role label edges and their 
corresponding role filler edges, as long as 
paired label and filler edges are adjacent. 

4. Since SBMT expects trees with preterminals, 
we simply replicate the label identities of 
concepts and role labels, adding a marker (‘P’ 
in Figure 4) to distinguish preterminals. 

The complete transformation can be seen in Fig¬ 
ure 4c. Apart from multiple parent ancestry, the 






original AMR can be reconstructed deterministi¬ 
cally from this SBMT-compliant rewrite. 

4.2 Tree Restructuring 

While the transformation in Figure 4c is accept¬ 
able to GHKM, and hence an entire end-to-end 
AMR parser may now be built with SBMT tools, 
we do not believe the resulting parser will exhibit 
very good performance. The trees we are learn¬ 
ing on are exceedingly flat, and thus yield rules 
that do not generalize sufficiently. Rules produced 
from the top of the tree in Figure 4c, such as that 
in Figure 5a, are only appropriate for cases where 
fear-01 has exactly three roles: ARGO (agent), 
ARGl (patient), and polarity. 

We follow the lead of Wang et al. (2010), who 
in turn were influenced by similar approaches in 
monolingual parsing (Collins, 1997; Charniak, 
2000), and re-structure trees at nodes with more 
than three children (i.e. instances with more than 
one role), to allow generalization of flat structures. 

However, our trees are unlike syntactic con¬ 
stituent trees in that they do not have labeled non¬ 
terminal nodes, so we have no natural choice of 
an intermediate (“bar”) label. We must choose a 
meaningful label to characterize an instance and 
its roles. We initially choose the concept label, re¬ 
sulting in trees like that in Figure 4d. However, 
this attempt at re-structuring yields rules like that 
in Figure 5b, which are general in form but are 
tied to the concept context in which they were ex¬ 
tracted. This leads to many redundant rules and 
blows up the nonterminal vocabulary size to ap¬ 
proximately 8,000, the size of the concept vo¬ 
cabulary. Furthermore, the rules elicited by this 
procedure encourage undesirable behavior such as 
the immediate juxtaposition of two rules generat¬ 
ing ARGl. We next consider restructuring with 
the immediately dominant role labels, resulting in 
trees like that in Figure 4e and rules like that in 
Figure 5c. This approach leads to more useful 
rules with fewer undesirable properties. 

4.3 Tree Relabeling 

AMR strings have an effective preterminal label of 
‘X,’ which allows them to compete with full AMR 
instances at decode time. However, whether or not 
a role is filled by a string or an instance is highly 
dependent on the kind of role being filled. The 
polarity and mode roles, for instance, are nearly 
always filled by strings, but ARGO and ARGl are 
always filled by instances. The quant role, which 


is used for representation of numerical quantities, 
can be filled by an instance (e.g. for approximate 
quantities such as ‘about 3’) or a string. To capture 
this behavior we relabel string preterminals of the 
tree with labels indicating role identity and string 
subsumption. This relabeling, replaces, for exam¬ 
ple, one ‘X’ preterminal in Figure 4c with “Spo- 
larity,” as shown in Figure 4f. 

4.4 Tree Reordering 

Finally, let us consider the alignments between 
English AMR. As is known in SBMT, non¬ 
monotone alignments can lead to large, unwieldy 
rules and in general make decoding more diffi¬ 
cult. While this is often an unavoidable fact of life 
when trying to translate between two languages 
with different syntactic behavior, it is an entirely 
artificial phenomenon in English-AMR. AMR is 
an unordered representation, yet in order to use 
an SBMT infrastructure we must declare an or¬ 
der of the AMR tree. This means we are free to 
choose whatever order is most convenient to us, as 
long as we keep role label edges immediately adja¬ 
cent to their corresponding role filler edges to pre¬ 
serve conversion back to the edge-labeled AMR 
form. We thus choose the order that is as close as 
possible to English yet still preserves these con¬ 
straints. We use a simple greedy bottom-up ap¬ 
proach that permutes the children of each internal 
node of the unrestructured tree so as to minimize 
crossings. This leads to a 79% overall reduction in 
crossings and is exemplified in Eigure 4g (before) 
and Eigure 4h (after). We may then restructure our 
trees, as described above, in an instance-outward 
manner. The final restructured, relabeled, and re¬ 
ordered tree is shown in Eigure 4i. 

5 AMR Language Models 

We now turn to language models of AMRs, which 
help us prefer reasonable target structures over un¬ 
reasonable ones. 

Our first language model is unintuitively 
simple—we pretend there is a language called 
AMRese that consists of yields of our restruc¬ 
tured AMRs. An example AMRese string from 
Eigure 4i is ‘ARGO soldier polarity - 
fear-01 ARGl die-01 ARGl We then 
build a standard n-gram model for AMRese. 

It also seems sensible to judge the correctness of 
an AMR by calculating the empirical probability 
of the concepts and their relations to each other. 




(a) A rule extracted from the AMR tree of 
Figure 4c. All roles seen in training must be 
used. 
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(b) A rule from the AMR tree of 
Figure 4d. Many nearly iden¬ 
tical rules of this type are ex¬ 
tracted, and this rule can be 
used multiple times in a single 
derivation. 


ARGl 


ARGl 
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(c) A rule from the AMR tree of Fig¬ 
ure 4e. This rule can be used indepen¬ 
dent of the concept context it was ex¬ 
tracted from and multiple reuse is dis¬ 
couraged. 


Figure 5: Impact of restructuring on rule extraction 


This is the motivation behind the following model 
of an AMR:^ 

We define an AMR instance i = (c, R), where 
c is a concept and i? is a set of roles. We define an 
AMR role r = {I, i), where I is a role label, and i is 
an AMR insfance labeled 1. For an AMR insfance 
i lef Ci be fhe concepf of i’s parenf insfance, and 
li be fhe label of fhe role fhaf i fills wifh respecf 
fo ifs parenf. We also define fhe special insfance 
and role labels ROOT and STOR Then, we define 
^AMR(f|^i, Ci), the conditional probability of AMR 
instance i given its ancestry as: 

PAMRii = ic,R)\li,Ci) = P{c\li,Ci)x 

n PR„,e(r|c) X P(STOP|c) 

r&R 

where 

PRoicir = il,i)\c) = Pil\c) X PamrC^I^c) 

We define P{c\li,Ci), P{l\c), and P(STOP|c) 
as empirical condifional probabilities, Wiffen-Bell 
inferpolafed (Wiffen and Bell, 1991) fo lower- 
order models by progressively discarding confexf 
from fhe righf.^ We model exacfly one STOP 
evenl per insfance. We define fhe probabilify of 
a full-senfence AMR i as PAMR(f|ROOT) where 
ROOT in fhis case serves as bofh parenf concepf 
and role label. 

^This model is only defined over AMRs that can be rep¬ 
resented as trees, and not over all AMRs. Since tree AMRs 
are a prerequisite of our system we did not yet investigate 
whether this model could be sufficiently generalized. 

®That is, P{c\li,Ci) is interpolated with P(c|h) and then 
P{c). 


As an example, fhe insfance associafed wifh 
concepf die-01 in Figure 4b has k = ARGl and 
Ci = fear-01, so we may score if as: 

P(die-01|ARG1, fear-0l)x 
P(ARGl|die-0l)x 
P(STOP|die-Ol)x 
P(*|ARG1, die-01) 

We add fhe AMR LM fo our sysfem as a feafure 
alongside fhe AMRese n-gram LM. 

6 Adding External Semantic Resources 

While we are engaged in fhe fask of semantic pars¬ 
ing, we have nol yef discussed fhe use of any se- 
manfic resources. In Ibis section we recfify fhaf 
omission. 

6.1 Rules from Numerical Quantities and 
Named Entities 

While fhe majorify of sfring-fo-free rules in SBMT 
sysfems are exfracfed from annofafed data, it is 
common practice to dynamically generate rules 
to handle the translation of dates and numerical 
quantities, as these follow common patterns and 
are easily detected at decode-time. We follow 
this practice here, and additionally detect person 
names at decode-time using the Stanford Named 
Entity Recognizer (Finkel et ah, 2005). We use 
cased, tokenized source data to build the decode¬ 
time rules. We add indicator features to these rules 
so that our tuning methods can decide how favor¬ 
able the resources are. We leave as future work 
the incorporation of named-entity rules for other 







System 

Section 

Tune 

Test 

flat trees 

4.1 

51.6 

49.9 

concept restructuring 

4.2 

57.2 

55.3 

role restructuring (rr) 

4.2 

60.8 

58.6 

rr - 1 - string preterminal relabeling (rl) 

4.3 

61.3 

59.7 

rr - 1 - rl - 1 - reordering (ro) 

4.4 

61.7 

59.7 

rr - 1 - rl - 1 - ro - 1 - AMR LM 

5 

62.3 

60.6 

rr - 1 - rl - 1 - ro - 1 - AMR LM -i- date/number/name rules (dn) 

6.1 

63.3 

61.3 

rr - 1 - rl - 1 - ro - 1 - AMR LM -i- dn -i- semantic categories (sc) 

6.2 

66.2 

64.3 

rr - 1 - rl - 1 - ro - 1 - AMR LM -i- dn -i- sc, rule-based alignments 

6.3 

67.1 

65.3 

rr - 1 - rl - 1 - ro - 1 - AMR LM -i- dn -i- sc, rule-based -i- unsupervised alignments 

6.3 

67.7 

65.8 

JAMR (Flanigan et al., 2014) 

3 

58.8 

58.2 


Table 2: AMR parsing Smatch scores for the experiments in this work. We provide a cross-reference to 
the section of this paper that describes each of the evaluated systems. Entries in bold are improvements 
over the previous state of the art. Human inter-annotator Smatch performance is in the 79-83 range (Cai 
and Knight, 2013). 


classes, since most available named-entity recog¬ 
nition beyond person names is at a granularity 
level that is incompatible with AMR (e.g. we can 
recognize ‘Location’ but not distinguish between 
‘City’ and ‘Country’). 

6.2 Hierarchical Semantic Categories 

In order to further generalize our rules, we mod¬ 
ify our training data AMRs once more, this time 
replacing the identity preterminals over concepts 
with preterminals designed to enhance the appli¬ 
cability of our rules in semantically similar con¬ 
texts. For each concept c expressed in AMR, we 
consult WordNet (Fellbaum, 1998) and a curated 
set of gazetteers and vocabulary lists to identify 
a hierarchy of increasingly general semantic cat¬ 
egories that describe the concept. So as not to 
be overwhelmed by the many fine-grained distinc¬ 
tions present in WordNet, we pre-select around 
100 salient semantic categories from the WordNet 
ontology. When traversing the WordNet hierarchy, 
we propagate a smoothed count^ of the number 
of examples seen per concept sense^, combining 
counts when paths meet. For each selected se¬ 
mantic category s encountered in the traversal, we 
calculate a weight by dividing the propagated ex¬ 
ample count for c at s by the frequency s was pro¬ 
posed over all AMR concepts. We then assign c 
to the highest scoring semantic category s. An ex- 

^We use very simple smoothing, and add 0.1 to the pro¬ 
vided example counts. 

*Since WordNet senses do not correspond directly to 
PropBank or AMR senses, we simply use a lexical match and 
must consider all observed senses for that match. 


ample calculation for the concept computer is 
shown in Figure 8. 

We apply semantic categories to our data as 
replacements for identity preterminals of con¬ 
cepts. This leads to more general, more widely- 
applicable rules. For example, with this trans¬ 
formation, we can parse correctly not only con¬ 
texts in which “soldiers die”, but also contexts in 
which other kinds of “skilled workers die”. Figure 
6 shows the addition of semantic preterminals to 
the tree from Figure 4i. We also incorporate se¬ 
mantic categories into the AMR LM. For concept 
c, let Sc be the semantic category of c. Then we 
reformulate PAURiilh, Ci) as: 

-Pamr(* = ic,R)\li,Ci) = 

P{Sc\k,Sci,Ci) X P{c\Sc,li,Sci,Ci)x 
PRoie(r|c) X P(STOP|sc,c) 

r&R 

where 

PRoicir = {l,i)\c) = P{l\sc,c) X Pamr(^I^c) 

6.3 Semantically informed Rule-based 
Alignments 

For our final incorporation of semantic resources 
we revisit the English-to-AMR alignments used to 
extract rules. As an alternative to the unsupervised 
approach of Pourdamghani et al. (2014), we build 
alignments by taking a linguistically-aware, super¬ 
vised heuristic approach to alignment: 

First, we generate a large number of potential 
links between English and AMR. We attempt 
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Figure 6: Final modification of the AMR data; se¬ 
mantically clustered preterminal labels are added 
to concepts. 


to link English and AMR tokens after conver¬ 
sion through resources such as a morphological 
analyzer, a list of 3,235 pertainym pairs (e.g. 
adj-‘gubernatorial’ —)• noun-‘governor’), a list 
of 2,444 adverb/adjective pairs (e.g. ‘humbly’ 
—)■ ‘humble’), a list of 2,076 negative polarity 
pairs (e.g. ‘illegal’ —‘legal’), and a list of 2,794 
known English-AMR transformational relation¬ 
ships (e.g. ‘asleep’ —)> sleep-01, ‘advertiser’ 
—)■ person :ARG0-of advertise-01, 

‘Greenwich Mean Time’ —)• GMT). These links are 
then culled based on context and AMR structure. 
Eor example, in the sentence “The big fish ate 
the little fish,” initially both English ‘fish’ are 
aligned to both AMR ‘fish.’ However, based on 
the context of ‘big’ and ‘little’ the spurious links 
are removed. 

In our experiments we explore both replacing 
the unsupervised alignments of Pourdamghani et 
al. (2014) with these alignments and concatenat¬ 
ing the two alignment sets together, essentially 
doubling the size of the training corpus. Because 
the different alignments yield different target-side 
tree reorderings, it is necessary to build separate 
5-gram AMRese language models.^ When us¬ 
ing both alignment sets together, we also use both 
AMRese language models simultaneously. 

7 Tuning 

We would like to tune our feature weights to max¬ 
imize Smatch directly. However, a very con¬ 
venient alternative is to compare the AMRese 
yields of candidate AMR parses to those of ref¬ 
erence AMRese strings, using a Bleu objective 
and forest-based MIRA (Chiang et ah, 2009). Eig- 

®The AMR LM is insensitive to reordering so we do not 
need to vary it when varying alignments. 
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Tuning Iteration 

Eigure 7: Bleu of AMRese and Smatch correlate 
closely when tuning. 


ure 7 shows that MIRA tuning with Bleu over 
AMRese tracks closely with Smatch. Note that, 
for experiments using reordered AMR trees, this 
requires obtaining similarly permuted reference 
tuning AMRese and hence requires alignments on 
the development corpus. When using unsuper¬ 
vised alignments we may simply run inference on 
the trained alignment model to obtain develop¬ 
ment alignments. The rule-based aligner runs 
one sentence at a time and can be employed on 
the development corpus. When using both sets of 
alignments, each approach’s AMRese is used as 
a development reference (i.e. each development 
sentence has two possible reference translations). 

8 Results 

Our AMR parser’s performance is shown in Ta¬ 
ble 2. We progressively show the incremental im¬ 
provements and compare to the current state of the 
art system of Elanigan et al. (2014). Purely trans¬ 
forming AMR data into a form that is compatible 
with the SBMT pipeline yields suboptimal results, 
but by adding role-based restructuring, relabeling, 
and reordering, as described in Section 4 we are 
able to surpass Elanigan et al. (2014). Adding an 
AMR EM and semantic resources increases scores 
further. Rule-based alignments are an improve¬ 
ment upon unsupervised alignments, but concate¬ 
nating the two alignments yields the best results. 

'°In practice, since the corpus is not too large and the 
released implementation by Pourdamghani et al. (2014) is 
based on the multi-threaded mGIZA-l-l- (Gao and Vogel, 
2008), we align twice, once with just the training data, 
and once with training and development data. We use the 
training-only run for rule extraction and only the develop¬ 
ment part of the training+dev for tuning. 


















Figure 8: WordNet hierarchy for computer. 
Pre-selected salient WordNet categories are 
boxed. Smoothed sense counts are propagated 
up the hierarchy and re-combined at join points. 
Scores are calculated by dividing propagated 
sense count by count of the category’s prevalence 
over the set of AMR concepts. The double box 
indicates the selection of artefact as the category 
label for computer. 


9 Related Work 

The only prior AMR parsing work we are aware 
of is that of Flanigan et al. (2014). In that work, 
multiple discriminatively trained models are used 
to identify individual concept instances and then 
a minimum spanning tree algorithm connects the 
concepts. 

Several other recent works have used a machine 
translation approach to semantic parsing, but all 
have been applied to domain data that is much nar¬ 
rower and an order of magnitude smaller than that 
of AMR, primarily the Geoquery corpus (Zelle 
and Mooney, 1996). The WASP system of Wong 
and Mooney (2006) uses hierarchical SMT tech¬ 
niques and does not apply semantic-specific im¬ 
provements. Andreas et al. (2013) use phrase- 
based and hierarchical SMT techniques on Geo¬ 
query. Like this work, they perform a transforma¬ 
tion of the input semantic representation so that it 
is amenable to use in an existing machine trans¬ 
lation system. However, they are unable to reach 
the state of the art in performance. Li et al. (2013) 
directly address GHKM’s word-to-terminal align¬ 
ment requirement by extending that algorithm to 


handle word-to-node alignment. 

Earlier work on using machine translation tech¬ 
niques for semantic parsing includes that of Pa- 
pineni et al. (1997). That work applies the IBM 
machine translation models (Brown et ah, 1993) 
to English paired with a non-structural formal lan¬ 
guage of air travel queries. In a similar vein, 
Macherey et al. (2001) use IBM models and their 
Alignment Template approach to analyze a rela¬ 
tively large corpus of German train scheduling in¬ 
quiries. The formal language they generate is not 
structural and has a vocabulary of less than 30 
types; it may thus be seen as an instance of se¬ 
mantic role labeling rather than semantic represen¬ 
tation parsing. 

Our SBMT system is grounded in the theory of 
tree transducers, which have also been applied to 
the task of semantic parsing by Jones et al. (2011; 
2012 ). 

Semantic parsing in general and AMR pars¬ 
ing specifically can be considered a subsumption 
of mulfiple semanfic resolufion sub-fasks, such 
as named enfify recognifion (Nadeau and Sekine, 
2007), semanfic role labeling (Gildea and Juraf- 
sky, 2002), word sense disambiguation (Navigli, 
2009) and relation finding (Bach and Badaskar, 
2007). 

10 Conclusion 

By resfrucfuring our AMRs we are able fo converf 
a sophisficafed SBMT engine info a baseline se- 
manfic parser wifh little addifional efforl. By fur- 
fher resfrucfuring our dafa fo appropriafely model 
fhe behavior we wanf fo capfure we are able fo 
rapidly achieve slale-of-lhe-arl resulfs. Einally, by 
incorporafing novel language models and external 
semanfic resources, we are able fo increase qualify 
even more. This is nof fhe lasf word on AMR pars¬ 
ing, as forfunafely, machine franslafion fechnology 
provides more low-hanging fruif fo pursue. 
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